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We provide an explicit example of a Bell inequality with 3 settings and 2 outcomes per site for which the 
largest violation is not obtained by the maximally entangled state, even if its dimension is allowed to be arbitrar- 
ily large. This complements recent results by Junge and Palazuelos (arXiv: 1007.3042) who show, employing 
tools from operator space theory, that such inequalities do exist. Our elementary example provides a simple, 
natural setting in which it can be explicitly demonstrated that even an arbitrarily large supply of EPR pairs is 
not the strongest possible nonlocal resource. 
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I. INTRODUCTION 

Entanglement is a powerful resource, facilitating compu- 
tation, communication, or more generally any nonlocal task. 
Like all resources it is useful to be able to measure it, so that 
entangled states could be ranked according to their usefulness 
for a given task. A very natural measure for the entanglement 
of any bipartite state £ T-L A ® "Kb is the entropy of entan- 
glement E{V) = S{ PA ) H, where S{p A ) = -Tx{p A \ogp A ) 
is the von Neumann entropy and p A = trg(|^) (^|) is the re- 
duced density operator of \^f) on one of the two subsystems. 
In any dimension d this measure is maximized by the maxi- 
mally entangled state 
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Since \^/d) exhibits the largest amount of entanglement, it 
would be natural to guess that it would indeed be the most 
useful state for any nonlocal task. This belief is reinforced by 
the fact that this state has proven extremely useful for many 
quantum information problems (e.g. J(| |H [3]), and is by it- 
self a sufficient resource for the creation of any other nonlo- 
cal state as soon as one allows local operations and classical 
communication (LOCC) JH. Moreover, it is known that any 
shared pure state \^) violates a Bell inequality if and only if 
it is entangled luOl 13411 . suggesting that the amount of entan- 
glement may play a central role in quantifying the strength of 
nonlocal correlations. 

For a long time it was implicitly assumed that \^d) is the 
most useful state with respect to violation of Bell inequali- 
ties 0J, The first doubts cast on this conjecture stem from 
a result by Eberhard jlTil who showed that when it comes to 
closing the detection efficiency loophole less entangled states 
can be more useful. More recently, such doubts were con- 
founded by the surprising fact that, at least in small dimen- 
sions in which numerical experiments can be conducted, there 
are inequalities for which the maximally entangled state does 
not give the maximum violation. More specifically, for every 
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dimension d there is a Bell inequality (such as the CGLMP 
inequality |fl5IO which in that dimension is maximally vio- 
lated by a state different from the maximally entangled state 
— a state with lower entanglement U 0, l37ll . Conversely, 
unlike for the maximally entangled state, even an infinite sup- 
ply of certain maximally non-local resources does not allow 
us to simulate all possible correlations coming from some less 
entangled states iTllfl . This has prompted the realization that 
nonlocality might be a resource of a different nature than en- 
tanglement, and many other examples have been discovered 
in the realm of Bell inequalities and quantum cryptography 
(see J27P for a survey), as well as quantum information the- 
ory IlZIIM]- Since Bell inequalities provide a very natural set- 
ting in which the properties of entanglement manifest them- 
selves, it is interesting to ask whether an arbitrarily large sup- 
ply of EPR pairs could indeed not be a sufficient resource for 
their maximal violation. In a recent paper, Junge and Palazue- 
los l2lll provided a negative answer to this question. In par- 
ticular, they showed using tools employed in the study of op- 
erator algebras and a probabilistic argument that there exists a 
family of Bell inequalities for which the maximum violation 
cannot be obtained by using the maximally entangled state, 
even if its dimension d is arbitrarily large. However, they do 
not provide any explicit such inequality, and moreover their 
results only hold in an asymptotic sense, so that it is not im- 
mediately clear what is the size of the smallest Bell inequality 
for which this phenomenon can be observed. 



A. Result 

In this note, we provide a simple example of an extremal 
Bell inequality that by itself already demonstrates that it is 
sometimes necessary to use less entanglement in order to ob- 
tain more nonlocality. Unlike the results of ll2lll we use only 
elementary techniques. 

The inequality we consider involves measurements on only 
two sites, Alice and Bob, where each party has only three 
measurement settings and two outcomes. This inequality is 
thus one of the simplest possible examples of a Bell inequal- 
ity where an arbitrary amount of EPR pairs could really make 
a difference. First of all, it is known that any inequality with 
only two settings and two outcomes per site can be maxi- 
mally violated when Alice and Bob each hold only a single 
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qubit HfJ. In particular, this means that if one were merely 
interested in knowning whether the maximally entangled state 
is necessary to obtain the maximum possible violation it suf- 
fices to check states in a fixed dimension, where it is already 
known that the maximally entangled state is not always the 
best. Second, however, the inequality we consider is also the 
first extremal Bell inequality after the CHSH inequality |[l2ll . 
The CHSH inequality is particularly fundamental in that vio- 
lation (by a certain state) of any two-setting inequality implies 
the same state also violates CHSH. Moreover, it is known that 
the CHSH inequality is maximally violated by a single EPR 
pair. When one allows three settings, the only new indepen- 
dent inequality (i.e., which can be violated by a state not vio- 
lating the CHSH inequality) is the one we consider lfl4ll . 

More specifically, the inequality we consider is the well- 
known -Z3322 inequality, first introduced in lfl9ll . We will use 
{A?}.je{i,2,3} an d {-^fc}fce{i,2,3} t° denote the measurement 
operators for the first of the two possible outcomes for Alice 
and Bob respectively. Using the common shorthands 

(AjBk) := WAj ® B h \¥) , (2) 
(Aj) := WAj ® id|*) , (3) 
(fl fc >:=<tt|id®B fc |4), (4) 

we define 

(/ 3 322> := - (A 2 ) - (Bi) - 2(B 2 ) + (A 1 B 1 ) 

+ (AxB 2 ) + (A 2 B 1 ) + (A 2 B 2 ) - (AxB 3 ) 
+ (A 2 B 3 ) - (A3B1) + (A 3 B 2 ) (5) 

While for classical correlations we have 

(i"332 2 ) < , (6) 

there exist measurements lfl4Tl such that using just one EPR 
pair (i.e. \ty) = ^2)) one can get 

(/ 3 322> = \ ■ (7) 

Yet, the precise maximum of (13322) over all quantum states 
and measurements remains unknown. Numerical upper- 
bounds were obtained using a SDP hierarchy in fl6ll . This was 
followed by recent exhaustive numerical investigations by Pal 
and Vertesi Bill , who report very interesting results. Their 
experiments suggest that the optimum violation of even 
though it only involves a constant number of settings and out- 
comes, might only be reached in infinite dimension. Indeed, 
they find strategies obtaining a value of at least 0.25084... 
(matching the upper bound up to precision 10~ 7 in dimension 
~ 100), and moreover in their experiments this value keeps 
increasing as the dimension of the strategies is allowed to in- 
crease. Moreover, even though the observables which achieve 
the maximum violation in a given dimension have a rather 
simple and systematic form, the corresponding state has an in- 
teresting distribution of Schmidt coefficients, and it is quite far 
from the maximally entangled state. Hence their results sug- 
gest a simple inequality for which the maximally entangled 
state of any dimension may not permit the largest violation, 



and moreover that the maximal violation cannot be attained 
with any finite-dimensional state. 

Here, we prove that indeed the maximally entangled state 
does not lead to the optimal violation of even such a simple 
inequality. That is, 

Theorem 1. For all dimensions d > 0, and any observables, 
using the maximally entangled state = \^&d) can lead to a 
violation of at most 

(/3322> <\- (8) 

Note that in contrast with previous work, (Q tells us that 
a value of 1 /4 can be attained using just one EPR pair, and 
hence the maximally entangled state in any dimension is no 
more powerful than the maximally entangled state for d = 2. 
This definitively demonstrates, in the simplest possible set- 
ting, that maximally entangled states are not the most nonlo- 
cal. 



B. Generic states 

Before embarking on our proof, it is worth pointing out 
that there does in fact exist a generic family of states that 
always allow us to obtain the maximum violation for any 
Bell inequality. These states, however, exhibit less entangle- 
ment than the maximally entangled state of same dimension. 
This "universal" family of states are known as embezzlement 
states i36ll . They previously played an important role in more 
involved tasks in quantum information theory, namely the so- 
called quantum reverse Shannon theorem J7[01, which pro- 
vided another example where the maximally entangled state 
is not sufficient to achieve the corresponding channel simula- 
tion result, but the universal embezzlement states are. The key 
property of the <i-dimensional embezzlement state that is 
used is that, for any pure state \*ff), there exists d and d! such 
that w \&d') ® |^)> where the equivalence only requires 
the application of local unitaries on each system; no commu- 
nication is needed plill . Since an embezzlement state can be 
used to obtain any other pure state by local unitary operations, 
it immediately follows that any Bell inequality can be maxi- 
mally violated by an embezzlement state (of possibly higher 
dimension), as pointed out recently in l30ll . This demonstrates 
that, even though in small dimensions it might seem like ev- 
ery inequality has its own specialized maximizing state, if one 
allows the dimension to grow larger, then a simple class of 
states is sufficient to obtain maximal violations. 



II. USING THE MAXIMALLY ENTANGLED STATE 

We now give a detailed overview of the proof of our main 
result (Theorem [T), relegating technical details to the ap- 
pendix. Throughout we will refer to a particular choice of 
measurements applied to the maximally entangled state as a 
strategy. Since our game is binary, it is known that we may 
assume without loss of generality (and without affecting the 
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underlying state) that the operators used by Alice and Bob are 
projectors |fl3l Proposition 2], and we will denote them by 
{Aj, id - Aj} for Alice and {B k , id - B k ) for Bob. We will 
also refer to 

W := (/3322> (9) 

as the value of a particular strategy. Our goal is to show that 
ui is at most 1/4, irrespective of the dimension d. We first 
introduce an important tool in our analysis, the CS decompo- 
sition of a pair of projectors. This decomposition was also at 
the heart of the results in 12611 . where it was used to handle the 
case of only two observables per site. 

The CS decomposition. Given a pair of d-dimensional 
projectors P and Q, there exists an orthonormal basis in 
which the two projectors are jointly block-diagonal (see 
for instance fioll ). Moreover, the blocks can be either 1- 
dimensional, in which case P and Q either have a or a 1 
in that block, or 2-dimensional, in which case they can be 
written in the form 

for some coefficients c E (—1,1) and s = Vl — c 2 . The 
angles 9 such that c = cos 9 are called the principal angles 
between the subspaces on which P and Q project. 

Our proof proceeds in two steps. Step 1 is to show that 
we can greatly simplify the form of Alice's and Bob's mea- 
surement operators. The main idea is to show using the CS 
decomposition that for any strategy maximizing (0 there ex- 
ists a basis in which all measurements are tridiagonal l38ll . 
This lets us greatly reduce the number of parameters and give 
a relatively simple analytic expression for the value ui of the 
strategy. Step 2 consists in upper-bounding this simple ex- 
pression using standard analytic techniques. 



A. Step 1: A simple joint normal form 

This is arguably the most crucial step in our proof, as it lets 
us show that a completely arbitrary strategy given by projec- 
tors {Aj -, Bk} 3 t k=i ,....3 can be put into a much simpler form 
without decreasing its value. As we mentioned previously, 
the key idea is to apply the CS decomposition twice, once 
to the pair (Ai,A 2 ), and once to the pair (Bi,B 2 ). This 
results in two orthonormal bases Ba and Bb such that the 
matrices of (Ai,A 2 ) in Ba are block-diagonal, with blocks 
of the form ( fTOb for A\ and ( fTTT ) for A 2 , and similarly for 
(Bi,B 2 ) in Bb- We number the blocks of (Ai,A 2 ) using 
even indices 2, . . . , d and call the corresponding coefficients 
C2i,S2i', the blocks of (Bi,B 2 ) are numbered using odd in- 
dices 1, . . . ,d+l and corresponding coefficients c 2 i + i, s 2 i + i. 

In general the bases Ba and Bb are unrelated, but we argue 
that, under the condition that the strategy maximizes (0, they 



must in fact be permutations of one another. To see this, note 
that (0 can be re-written as 

(^3322) = 

{A 1 + A 2 ,B 1 + B 2 ) + (A 2 - A u Bz) + (A 3 , B 2 - B x ) 
- {A 2 , id) -(id, B x ) -2(id,B 2 ) (12) 

where 

(A,B) = -Tt(A t B) (13) 
d 

and we used that if is the maximally entangled state then 

(¥|Ag)B|¥> = (A,B) . (14) 

Note that since the Aj operators always appear on the left 
of the tensor product (Alice's side), we will henceforth ar- 
gue about A J rather than Aj, omitting the transpose sign for 
simplicity of notation. For the moment, let's ignore the con- 
tribution of the last three terms in ill 2b . Observe that A3 
(resp. B3) only appears in the term (A 3 ,B 2 — Bi) (resp. 
(A 2 — A\, B 3 )). When maximizing over A 3 it is thus clear 
that the optimal choice is to make A3 the projector onto the 
positive eigenspace of B 2 — B\ (resp. B 3 to project on the 
positive eigenspace of A 2 — A\). This in particular implies 
that the value of those two terms is independent of the choice 
of Bb (resp. Ba)- Hence the choice of the bases Ba, Bb only 
bears influence on the value of the first term in ( TT2b . 

Let us now examine the first term. Note that the precise 
form ( [Tol l, ( fTTT i in which we wrote the CS decomposition en- 
sures that A\ + A 2 is diagonal in Ba (resp. B\ + B 2 in 
Bb)- It is well known (see Claim [6] in the appendix) that 
(Ai + A 2 , B\ + B 2 ) is maximized whenever the vectors in 
8b are a permutation of those in Ba- It follows that for the 
optimal choice of bases A\ + A 2 and B\ + B 2 will necessarily 
be simultaneously diagonal. 

However, this does not necessarily imply that the blocks of 
(Ai,A 2 ) are aligned with those of (Bi,B 2 ), as correspond- 
ing pairs of basis vectors need not match — in fact, if they 
did, then it is not hard to see that the strategy would be re- 
duced to a convex combination of 2-dimensional strategies, 
which would conclude our proof. Nevertheless, by a sim- 
ple argument we can show that without loss of generality 
the blocks are simply "shifted": there exists an ordering of 
Ba = {ei, . . . , ed} such that if the blocks of (Ai,A 2 ) cor- 
respond to pairs (ei, e 2 ), (e 3 , €4), . . . then those of (Bi, B 2 ) 
can be seen to correspond to pairs (e^, ei), (e 2 , e 3 ), . . .. 

The exact form we obtain for the strategies is given in Def- 
inition|4]in the Appendix, and gaps in the argument above are 
filled in the proof of Lemma|5] which can informally be sum- 
marized as follows. 

Lemma 2 (Lemma informal). There exists a basis 
(ei, . . . , e c i) in which 

• (A\, A 2 , B3) (resp. (£?i, B 2 , A 3 )) are jointly block- 
diagonal. 

• The blocks corresponding to each of these decomposi- 
tions are shifted: blocks of (A\ , A 2l B3) correspond to 
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pairs ( e2i-i, e 2i), w/w/e blocks of (A±, A2, B3) corre- 
spond to pairs (e 2 i, e2i+i). 

• 77ie blocks of (A\, A2) are of the form d 1 Ob . (II 11 1 w/f/1 
coefficients (c2j, S2i)> * = 1 , - ■ - , cZ/2, w/zde f/zose 0/ 
(Si, B2) are of the same form with corresponding co- 
efficients (cst+i, s 2 j+i), i = 0, . . . , d/2 - 1. 

B. Step 2: The value of a strategy in joint normal form 

Once we have found a nice basis in which to express all 
observables appearing in the strategy, it should appear as no 
surprise that the value of (0 should be easily expressible as 

a function of the coefficients (cj)j=i d, since these are the 

only free parameters left in our choice of strategy. In fact, 
fixing coefficients q where i is even, it is not hard to determine 
the optimal choice of coefficients Cj for odd i. This reduces 
the size of our problem to the d/2 parameters c 2 , . . . , c^. One 
can then show that the strategy has the following value (cf. 
Lemma[8]for a more precise statement): 

1 d/2 _ 
W = -^/(c 2 i-l,C2i+l) + ^ + , (15) 

2 — 1 

where 



f(x,y) = v/(z + y)2 + 1 + - y/l - + - y/1 - if - 2 . 

We have thus rephrased the problem of maximizing (13322) 
over all strategies in terms of maximizing ui over all admis- 
sible coefficients (c2j-i)j=i d/2+i- To prove our claim, it 
only remains to prove an upper bound on oj, which can be 
done using standard analytical techniques provided in the ap- 
pendix. 

Lemma 3. Let c^i-i € [— 1, l],/or i = 1, . . . , d/2 + 1. Then 
the expression u> = uj{c%) in (1151 ) is upper-bounded by j. 

III. CONCLUSION AND OPEN QUESTIONS 

We have provided a concrete example of a simple inequal- 
ity for which it can be shown that the maximally entangled 



state of any dimension is not the most nonlocal state. An in- 
teresting question, already asked in (3111. is whether one can 
show that optimal violation of the ^3322 inequality requires 
a state of infinite dimension. This is strongly suggested by 
the strategies found numerically by Pal and Vertesi, which, 
even though they are based on an entangled state which is 
very far from the maximally entangled state, have a matrix 
form which is quite similar to the one in Def. [4] Extending 
our argument to show that Alice and Bob's measurements al- 
ways have this form, even when they do not use the maximally 
entangled state, would be a big step towards proving that no 
finite-dimensional strategy is optimal [3]. This would not only 
have very interesting consequences for our understanding of 
Bell inequalities, but also for the optimization of polynomials 
with non-commutative variables. In particular, it would imply 
that the SDP hierarchies suggested in llq, l28l 12911 only con- 
verge in the limit of infinitely many levels, which is an open 
problem even outside the realm of quantum information. 

Note: After posting our work to the arXiv, we learned about 
related work by Liang et al. (24ll . which by now has also 
appeared on the arXiv. In a different context and indepen- 
dently of our results, they found a two-setting, two-outcome 
linequality for which the maximally entangled state of any di- 
mension is not the most nonlocal. Recall, however, that in that 
case it is already known that local dimension 2 is sufficient for 
both Alice and Bob, and hence it suffices to determine whether 
maximal entanglement in that fixed dimension is necessary to 
obtain the maximum violation. The motivation for their work, 
however, is in a different context in which such inequalities 
are indeed interesting. 
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Appendix A: A joint normal form for strategies using the 
maximally entangled state 

The goal of this section is to prove Lemma[5] which shows 
that any optimal strategy must have a certain simple joint nor- 
mal form. Before we define it precisely, note that in order 

for the strategy {Aj, Bk}j.k=i 3 to be optimal, for a fixed 

choice of {Bk} it is necessary that the operators {Aj} be cho- 
sen so as to maximize 

(*|i4i®(Bi+B 2 -B 3 )|*) (Al) 
(y\A 2 ®(B 1 +B 2 +B 3 -\d)\'$) (A2) 

(*|i4a®(B 3 -Bi)|*) (A3) 

while for fixed {A,-}, the {Bk} should maximize 

(*\B 1 ®(A 1 +A2-As-\d)\V) (A4) 

(*|B 2 ® (A 1 +A 2 +A 3 -2id)|*) (A5) 

(*|fl 3 ® (A 2 - A{)\*) (A6) 

Since |^) is the maximally entangled state, for any A and 
B we have ($\A <g> = ±Tr(AB T ) =: (A, B), where 

(-, •) denotes the real Hilbert-Schmidt matrix inner product. 
To simplify notation, and since the Aj operators always ap- 
pear on the left of the tensor product (Alice's side), we will 
argue about Aj rather than Aj, omitting the transpose sign. 
Hence given for instance B\,B 2 and B 3 , the A\ maximiz- 
ing dAU is simply the projector on the positive eigenspace of 
B\ + B 2 — B 3 . In particular, if B\, B 2 and B 3 have a joint 
block-diagonalization this will be reflected in B\ + B 2 — B3 
and hence in Ai . This observation, combined with the CS de- 
composition for a pair of projectors, will let us find a simple 
joint form for all the Aj and B^, as explicited in the following 
definition. 



Definition 4. For any c € [— 1, 1], let s = yl — c 2 and define 
the 2- dimensional projectors 



Pi(c) ■= 

P3 ■■= 



1/l-c -s 

2 \ s 1 + c 

1/l-c s 

2\s 1 + c 

1/1 1 
2 U 1 



(A7) 
(A8) 
(A9) 



We say that d- dimensional projectors {Aj, Bk} are in joint 
normal form if there exists a basis ofC d such that either 
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• For even dimensions d, there exist reals a G [— 1, 1], 
i = 1, . . . , d + 1 such that: 

- A\ ( resp. A 2 j is block-diagonal with blocks L\ l = 
Pi(c 2i ) (resp. Lf = P 2 (c 2l )), i = 1 . . . d/2 

- B3 is block-diagonal with blocks all identical to 
P 3 - 

- B\ (resp. B 2 ) is block-diagonal, with the first 
block R\ (resp. R\) one-dimensional equal to 
(i^i), the following d/2 - 1 blocks R\ i+1 = 
Pi(-c 2i +i) (resp. Rf +1 = P 2 {-c 2l+1 )), 1 = 
l...d/2-l, and the last block R* +1 = ( 1 ~ c 2 d+1 ) 
(resp. R d 2 +1 = C-^)). 

- A3 is block-diagonal with its first block one- 
dimensional equal to (1), the following blocks 
all identical to P3, and the last block one- 
dimensional equal to (1). 

• For odd dimensions d, there exist reals Ci € [—1, 1], 
i = 1, . . . , d+ 1 such that: 

- A\ (resp. A 2 ) is block-diagonal with (d — 
l)/2 2-dimensional blocks Lf = P\(c 2 i) (resp. 
Lf = P 2 (c 2i )), i = 1 ... (d - l)/2, and a fi- 
nal 1-dimensional block L d+1 = (- — ^ ±i ) (resp. 
L d+i = (iz£i±i)j ; 

- B3 is block-diagonal with the first (d—l)/2 blocks 
all identical to P3, and the last one 1-dimensional 
equal to (1). 

- B\ (resp. B 2 ) is block-diagonal with an ini- 
tial one-dimensional block R\ = ( 1 ~ 2 Cl ) (resp. 
R\ = i 1 ^)) and the following (d - l)/2 
blocks Ri + = Pi{—c 2 i+i) (resp. R 2 l+ = 
P 2 {-c 2l+1 ),i = l...(d-l)/2. 

- A3 is block-diagonal, with the first 1-ditnensional 
block equal to (1), and all following blocks iden- 
tical to P3. 

Or the same as above, but with the roles of {A\, A 2 , B3} and 
{B\, B 2 , A3} exchanged. 

The main lemma of this section is the following: 

Lemma 5. Suppose Ai,A 2 , A3 and B\, B 2 , B3 are six d- 
dimensional projectors achieving the maximum of © over all 
d-dimensional strategies using the maximally entangled state 
l^i). Then there is a d! < d, and a d' -dimensional strategy in 
joint normal form which achieves a value at least as large as 
thatof{Aj,B k }. 

Proof. Apply the CS decomposition to A\ and A 2 , resulting in 
a joint block-diagonalization basis {|ej)}i, and to B\ and B 2 , 
resulting in {\fi)}i- We first show that we may take {|e,)} — 
{\fi}} without lowering the value of the strategy. 

As we already noted, the optimal choice for A3 (resp. B3) 
is the projector on the positive eigenspace of B 2 — B\ (resp. 
A 2 — A\). This implies that the value of (IA31 > does not depend 



on the choice of basis {|ej}}, but only on the eigenvalues of 
B 2 — B\. Hence of all the terms in Q, the only ones whose 
value depends on the choice of the bases {|ej)} and can 
be grouped together as (*|(Al + A 2 ) <g> (Bx + B 2 )|*)- 

Claim 6. Let |*) = and A = Ei«iKX u i| 

and B = J^. Pi\vi)(vi \ positive. Then the expression (^\A ® 
B\ty) is maximized when the \ui), \vi) are a permutation of 
the Schmidt basis of \ 

Proof. For any two matrices A, B we have (fy\A ® B^) = 
2Tr(A T B). Note that A T has the same eigenvalues as A. We 
then have by 1I22I Lemma IV. 1 1 ] that there exists a permuta- 
tion 7r e Sd such that 

\tr(A T B)<Y,^f , (MO) 

3=1 

where A^ 1 , . . . , and Af , . . . , Af are the eigenvalues of A 
and B respectively. □ 

Given our specific choice of basis for the block- 
diagonalization, we have that A\ + A 2 (resp. B\ + B 2 ) is di- 
agonal in the basis { | ) } (resp. hence Claim|6]shows 
that these two bases may be taken equal (up to permutation) 
without lowering the value of the strategy. 

We call a strategy given by projectors {Aj, B k }j,k irre- 
ducible if it cannot be decomposed as a direct sum of lower- 
dimensional strategies. We show that any irreducible strategy 
has the form described in Definition^ 

Claim 7. Suppose {Aj , Bj } is irreducible. If d is even, then 
either all blocks of the joint decomposition of {Ai, ^2,-83} 
and {B\ 1 B 2 , A3} are two-dimensional, or {Ai, 83} 
have exactly two 1-dimensional blocks and {B\, B 2 , A3} 
none (or vice-versa). If d is odd, then each of {A\ 1 A 2l B3} 
and {Bi, B 2l A3} have exactly one common 1-dimensional 
block. 

Proof. We treat the case of even dimension, the odd- 
dimensional case being analogous. Reason by contradiction 
and first assume e.g. that {Ai,A 2 , B3} each have more than 
two 1-dimensional blocks in their joint block-diagonalization. 
We show that there is a non-trivial subspace stabilized by all 
operators {Aj, B k }, contradicting the strategy's irreducibility. 

Let I ei ) be the vector corresponding to a one-dimensional 
block of {Ai, A 2 , B3}. Since the are a permutation 

of {|ei)}, there exists an i\ such that \fi 1 ) = |ei). There 
are two possibilities for 1/^): either it is a joint eigenvector of 
Bi , B 2 and A3 (i.e. it corresponds to a one-dimensional block 
in their joint block-diagonalization), or there exists an index 
i 2 such that Spanjl/^), \fi 2 )} is left invariant by the action 
of Bi,B 2 and A3 (i.e. it corresponds to a two-dimensional 
block). In the first case we have already found a strict sub- 
space Span{|ei)} stabilized by all {Aj, In the second 
case we can iterate this procedure, assuming without loss of 
generality that \e 2 ) = \ fi 2 ). There are again two cases: either 
\e 2 ) corresponds to a 1-dimensional block of {Ai, ^2,-63}, in 
which case Span{|ei), \e 2 )} is a non-trivial stable subspace, 
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or there is a vector |ea) such that (|e2), |e3)) corresponds to a 
2-dimensional block of {Ai,A 2 , B 3 }. We will then find an i 3 
such that |/jg) = 1 63), and so on. 

In all cases, the process must end as soon as one of the vec- 
tors \ek) encountered corresponds to a 1 -dimensional block 
of {Ai, A2, B3}. Given our assumption that there were three 
or more such blocks, we have found a strict subspace stabi- 
lized by all {Aj, Bk}, contradicting the irreducibility assump- 
tion. □ 

As a consequence of Claim|7l we can block-diagonalize the 
pair of projectors {A\ , A2) with blocks 



t 2i _ 1 A - c 2i S2i 

1 2 I -s 2i 1 + c 2l 



L'i 



1 fl~ C 2 i S 2 i 

2 V S 2i 1 + C 2l 



(All) 
(A12) 



where c 2 ; £ ( — 1,1) and s 2 j = yl — c^, together possibly 
with an initial and final 1-dimensional blocks, depending on 
the parity of the dimension. 

In the definition of a normal form we also require the one- 
dimensional blocks to have the same coefficients for both Ai 
and A2, which is is easily seen to hold without loss of gen- 
erality from the optimality of the strategy {Aj, Bk}- Indeed, 
let i be the index of such a block, corresponding to vector 
|e.;); Ai and A2 are necessarily chosen so as to maximize 
the value of dAll i and iA2i respectively, and the coefficient 
in front of (Ai)i^ and (^2)1,; will be the same in both equa- 
tions, so that the optimal choice is the same. Similarly, the 
matrices (Bi , B 2 ) can be block-diagonalized with blocks: 



2i+l 



R 



2i+l 



+ C 2l +l -S 2 i+1 

2 V -S2i+1 1 - C 2i+ 1 

1 (l + C 2l +1 S 2 i+1 

S2i+1 1 — C2i+1 



(A13) 
(A14) 



Finally, it is easy to infer from dA3b (resp. iA6i ) the necessary 
form of A3 (resp. B3): indeed, it is simply the projector on 
the positive eigenspace of £> 2 — B\ (resp. A2 — A{), which is 
a block P3 whenever B±, B2 (resp. ^4i, A2) have a common 
2-dimensional block, and a block (1) whenever B\,B 2 (resp. 
A\ , A2) have a common one-dimensional block. □ 



Appendix B: The value of a strategy in joint normal form 

In this section we derive an expression for the value ob- 
tained in © for any strategy in joint normal form (Lemma[8]), 
and then show how it can be upper-bounded by analytical 
techniques (Lemma [Tot. 

Lemma 8. Suppose {Aj, Bk} is a strategy in joint normal 
form, described by a certain block structure and correspond- 
ing sequence of coefficients C{. Then the value of ©/or this 
strategy for even dimensions d is given by 



d/2 
i=l 



-l,C 2 j+l) + 



ci - C d+ l 

2d 



(Bl) 



and for odd dimension d by 

(d-l)/2 

W= ~j ^2 /( C 2i-l= c 2i+l) 



(B2) 



i=l 

1 / Ci - C d+1 1 

1 + -a/1 - c:, 



+ dK CdCd+1 

where 

f{x,y) 

= ^(x + y y + l + ^l-x 2 + ^l-y 2 -2. (B3) 

Proof. We treat the cases of even and odd dimension sepa- 
rately. 

a. d even. In that case we know that the block- 
diagonalization of either {Ax, A 2 , B3} or {B\, B2, A3} con- 
tains exactly two 1-dimensional blocks, while the other 
contains none. We assume that {B\, B2, A3} has no 1- 
dimensional blocks; the other case is treated symmetrically. 
In this case we can write 



A, = 



B 2 = - 



- c 2 




s 2 












S2 


1 


+ C 2 




















1 


- c 4 




S4 












s 4 


1 


+ C4 


- C\ 




















1 


+ c 3 




S3 












S3 


1 


- c 3 




















1 


+ c 5 



V 



(B4) 



(B5) 
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where A\ and B\ are identical to A2 and B2 respectively 
but have their off-diagonal elements negated, and c\, Cd+i G 
{-!,!}■ 

Fixing the coefficients of B\ and B 2 , we can derive con- 
straints on those of A\ and A2 from the constraint that they 
should be chosen so as to maximize ( IAU and ( IA2t . The two 
equations are similar; let's look at iA2i . Its value can be cal- 
culated as 

\ Y,( A ^ + (Biki + (Bzki - hi) 



1 -j 



d/2 



-5E(5( 1 -«)« 1 -«-') + i-i) 

+ -(l + c 2i )((l + c 2i+ i) + --l)) 

d/2 

d^ 2 



d/2 



-y 



1=1 

, l + c 2i 



1 - C 2i I 1 

2 



C-2i-\ 



C2i+1 



£2, 

2 



(B6) 
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Setting r 2i = (c 2 i_i + c 2 i+i)/2, for a fixed C2i_i,c 2 i+i the Claim 9. Let f(x,y) = yj(x + y) 2 + 1 + \/l - x 2 /2 + 
choice of c 2 i which maximizes ( |B6t is c 2i = 2r 2 ^ (4t|^ + 

l)- 1 / 2 , which gives a value ofjE^ v 74 ^ + V 2 + V 2 + 
(c2i+i — C2j-i)/2 for JA2b . d A 1 1 ) is maximized for the same 
choice of coefficients, and has exactly the same value. Con- 
cerning dA3l ). we find that its value is simply 



d/a-i 

^^(^((^•-(50^) = - x 

i= 1 



S2i-\-l 



(B7) 



Combining ( 1A1UA2I ) and dA3K and subtracting 
(l/d)(Tr(Bi) + 2Tr(B 2 )), we obtain the value of ©, 
which is thus 

1 d/2 

U = - (V ( C 2>-1 + C2i+l) 2 + 1 + 1 
i=l 

d/2-1 



i— 1 



-2-t+l 



Crf+1 ~ Cl 

2d 



(B8) 



where we replaced S2«+i = y 1 — c 2i+i- Using the definition 
of /, this can be re-written as 



x d/2 

2 E /( C 2i-l! c 2i+l) + 
i— 1 



Cl - C d+ i 

2d 



b. d odd. In that case, each of {Ai,A 2 ,B3} and 
{Bi, B 2 , A3} must have a 1 -dimensional-block in their joint 
block-diagonalization; say that the one for {A 1: A 2 ,B 3 } is 
the last block while the one for {B\ , B2 , A3} is the first block. 
We can proceed exactly as above to evaluate the value of this 
strategy, under the condition that it is optimal and hence max- 
imizes (IAlt - dA3b . which lets us express the even coefficients 
C2i as a function of the odd ones C2j+i. Omitting a few cal- 
culations very similar to the ones we performed in the even- 
dimensional case, we obtain that the value of this solution is 



(d-l)/2 



2 J2 (V(c2i-l+C2i+l) 2 + l + l) +i(c d -Ci) 



i=l 

1 , w l 

+ -(l-c d+1 )(--c d ) 

(d-l)/2 



2 — 1 X 



3 S v+ 



(d-l)/2 



(BIO) 



i=l 



5 ( 



. Cl - C d+ l 

CdCd+i H 1 



□ 



It now remains to bound u. The following claim, proven in 
Section|C] will be useful. 



i/l — y 2 /2 — 2 be defined on [— 1, l] 2 . 7%en 

7. TTze maximum of f(a, b) + f(b, c) over a/Z a,b,c G 
[— 1, l] 2 smc/i that a + 6 > anc/ + c < is less 
than .244. 

2. The maximum of f(l,b)+f (b,c) over all b,c G [— 1, l] 2 
5mc/z f/iaf 1 + 6 > anc/ + c < Oii /ess f/zan .103. 

3. The maximum of f (a, 1) over all a G [— 1, 1] is less than 
.368. 

Lemma 10. Let Ci G [— 1, 1], for i = 1 . . . d + 1. 77ien f/ie 
expression u> = u;(cj) in foof/; (IB 11 1 ana! (IB2b is upper-bounded 

by I- 

Proof. First note that the maxi mum va lue of the expression 

CrfC rf+ i + Cl +^ +1 - 1 + jy/l - c 2 , over all ci,c d+ i G 



Cl 2 ^ d+1 is upper-bounded 
..,c d ) G [— 1, l] d_1 and 



{-1,1} and Cd G [—ljl] is less than 1/4, hence OijZD is 
always lower than dBU . Hence it is sufficient to show that 

u = 3-Ei=l/( c 2i-l)C2i+l) "+ 

by 1/4, for any even <i and (02 
ci,c d+1 G {— 1, 1}. 

It is easy to verify that f(x, y) < 1/2 on the square (x, y) G 
[—1, l] 2 . Unfortunately, the extra term C1 ~ 2d d+1 potentially 
induces an additive 1/d, so that it is not so immediate to bound 
uj. Note that we can assume that c% = 1 and c</+x = — 1, since 
otherwise the bound follows trivially from the upper-bound on 

f{x,y) < i/ 2 - 

Given the value of c\ and c^+i, there must exist an i such 
that c 2 j_i + c 2 i+i > and c 2i +i + c 2i+3 < 0; let i be the 
first such i. We distinguish four cases, depending on the value 
of i . 

• If d = 4, one gets that /(l,c 3 ) + /(c 3 ,-l) < 0. 
Adding (ci — Cd+i)/8, one can see that ui < 1/4. We 
assume d > 4 for the remaining cases. 

• If io = 1, we can use the second bound in Claim [9] 
to bound f{ci,cs) + f (03,05) by .103, since c\ 

I In this case the value of f (01,03) + f (03,05) + 
f(cd~i,Cd+i) is at most .103 + .368 < .5. Adding 
1 = (cx — Cd+x)/2 and dividing by d, we see that 
u < 1/4 irrespective of the value of the other Cj (re- 
call that f(x, y) < 1/4 for all (x, j/)). 

• If io = d/2 — 1, the same bound can be obtained by 
symmetry. 

• Otherwise 1 < iq < d/2 — 1, in which case by using 
the first and last bounds from Claim [9] we see that the 
value of f(a , c 3 ) + f(c2i-i, c 2l+ i ) + f(c 2 i+i, c 2l+3 ) + 
/(cd_x,Cd + x) is at most .244 + 2 ■ .368 < 1. Again 
adding 1 = (c\ — Cd+i)/2 and dividing by d, one sees 
that io < 1/4 irrespective of the value of the other c^. 



□ 
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Appendix C: Details of Claim|9] 

We now provide the details of Claim|9] To find the claimed 
upper bounds we use a well-established optimization tech- 
nique based on a hierarchy of semidefinite programs (SDPs) 
backed by the real Positivstellensatz l32l l33ll . More specifi- 
cally, if t denotes a claimed upper bound, our goal will be to 
show that for any variables a, b and c satisfying the constraints 
we have t — h(a, b, c) > 0, where h(a, b, c) denotes the func- 
tion we wish to optimize in case 1, 2 or 3. To this end, we will 
first rewrite any terms involving *J~- in the function h(a, b, c) 
in terms of additional variables. Second, we will use poly- 
nomial optimization techniques from ll32l l33ll to obtain the 
bound t. This is exactly analogous to the techniques estab- 
lished in quantum information to obtain bounds on quantum 
violation of Bell inequalities (Mill HI. 

We would like to emphasize that whereas semidefinite pro- 
gramming, as for example performed in Matlab, is a numeri- 
cal technique, if a bound tg is obtained at level £ of the SDP 
hierarchy then it is in principle possible to extract an analyt- 
ical proof that tg is an upper-bound on the corresponding ex- 
pression h from the numerics. That is, we do not rely on any 
heuristic optimization methods that are not guaranteed to pro- 
vide a rigorous bound. 



1. Case 3 

For completeness, we provide a brief informal sketch of this 
method for case 3; details can be found in l32l[33ll. or in the 
dual view of the SDP, as explained in this survey J23t] . First of 
all, substituting 



(a + l) 2 + l 



2a + 2 , 



2 -i 2 

z := 1 — a 



(CI) 
(C2) 



our goal of showing that t = 0.368 is an upper bound to 
/(a, 1) can be restated as showing that 

we have t > x + \z — 2 
whenever x 2 = a 2 + 2a + 1 

z 2 = 1 - a 2 

-1 < a < 1 . 

For simplicity, we will without loss of generality ignore the 
last constraint. Now note that if we were able to find polyno- 
mials ti and t 2 in variables x, z, and a such that 

p := t - (x + ^z - 2^j - ti(a 2 + 2a + 2 - x 2 ) (C3) 

- t 2 (l - a 2 - z 2 ) = s , 

where so is a polynomial in x, z and a which is a sum of 
squares, then for any variables satisfying the desired con- 
straints t — (x + 7}Z — 2) > since so is always positive. 
Our goal can thus be rephrased as searching for suitable poly- 
nomials ti and t 2 such that we can rewrite the resulting poly- 
nomial as a sum of squares. Very intuitively, level £ of the 



SDP hierarchy searches for such polynomials up to degree 
2£ by searching for a matrix Qe such that Qe > and for 
Vt = (1, a, x, z, . . .) being the vector of all possible monomi- 
als up to degree £ where we have v\Qgvg = p. To convince 
ourselves, note that this means we search for Qi > such that 



x+-z-l 



(C4) 



+ ti(a 2 + 2a + 2 - x 2 ) + t 2 (l - a 2 - z 2 ) 
and thus for variables satisfying the constraints 

t - (x+^z-l^J = vlQiVi , (C5) 

which is clearly positive. The actual sums of squares poly- 
nomials so can be obtained from Q by diagonalizing Q = 
U^DU where U is unitary and D is a diagonal matrix. Since 
D only has positive entries (Q > 0), we obtain that so = 
dj (Uv)^ (U v)j is indeed a sum of squares. 
It turns out that for case 3, we can already find such a matrix 
Q at level £ = of the SDP, that is, £ 1? t 2 e E are simply 
scalars. To see how this works explicitly, let us first rewrite 
the polynomials above in terms of matrices. Let 

°\ 




°/ 
10 

0-10 
-1 



M 



Mi 



f-2 
l 

! 

4 





(C6) 



(C7) 



Mo 



T 



2 1 
0-100 


10 1 

( 0' 









(C8) 



(C9) 



Clearly, for 



v := (1 x z a) 1 



we have 



(C10) 

(Cll) 

(C12) 
(C13) 

From the numerical solutions obtained by Matlab with Se- 
DuMi JUt] and YALMIP [25], we can guess an analytical so- 
lution given by 





= x-{ 


1 

- — z 
2 


- 2 




= 1 - 


a 2 - 




^M 2 v 


= a 2 


f 2a 


+ 2-x 2 



11 = 0.51 

1 2 = 0.24 

t = 0.368 



(C14) 
(C15) 
(C16) 
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for which we can easily verify that 

Qa := S - M Q - hAh - t 2 M 2 > , (C17) 
which concludes our claim. 

2. Cases 1 and 2 

The bounds for cases 1 and 2 are obtained analogously. The 
only difference is that we have to deal with more variables. 
Again, we first introduce auxiliary variables to eliminate terms 



containing yf-. We then search for suitable polynomials like ti 
and ti above. Unlike for the simple case 3, the desired bounds 
are not obtained at level I = of the hierarchy. However, they 
are already found at level I = 1, and an analytical solution 
can again be extracted. Yet, since at level I = 1 we observe 
polynomials of degree up to 2 in both the original and the 
auxiliary variables (in total 6 for case 2, and 8 for case 1) the 
resulting problem is already rather large (involving matrices 
of size 82 x 82 for case 1). We do not include these matrices 
here, but the Matlab scripts that can be used to extract the 
analytical values are available upon request. 



